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[57] ABSTRACT 

The invention relates to a method and apparatus for main- 
taining data/parity consistency in a RAID data storage 
system. The invention utilizes reserved disk storage space in 
the RAID array to log data necessary to restore data/parity 
consistency should an interruption event, such as a power 
failure, corrupt the data stored in a particular redundancy 
group. In one embodiment, the invention logs new data and 
new parity information to the reserved disk storage space 
before the new data and new parity information are written 
to the appropriate locations in the associated redundancy 
group. In this way, if an interruption event occurs when 
either the new data or the new parity information has already 
been written to the redundancy group but the other has not, 
the corresponding data and parity information stored in the 
reserved disk storage space can be used to restore data/parity 
consistency after the event has ended. 

21 Claims, 3 Drawing Sheets 
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PRESERVATION OF DATA INTEGRITY IN A 
RAID STORAGE DEVICE 

This is a continuation of application Scr. No. 08/549,292 
filed on Oct. 27, 1995, now abandoned. 

HELD OF THE INVENTION 

The invention relates, in general, to magnetic data storage 
systems and, in particular, to a method and apparatus for 
maintaining consistency between data and parity informa- 
tion in a disk array subsystem operating in degraded mode. 

BACKGROUND OF THE INVENTION 

Magnetic disk storage is currently the most widely used 
method of mass storage for computer systems. Traditionally, 
systems using this method of mass storage have included a 
single disk capable of storing large amounts of data. 
However, systems using an array of smaller capacity, less 
expensive disk drives are currently emerging as a low cost 
alternative to large single disk systems. These array systems 
are known as RAID (redundant array of independent drives) 
systems. 

When used in conjunction with a host computer, a RAID 
system appears to behave just like a single disk system. 
RAID systems, however, offer many advantages over single 
disk systems. One of the most important advantages of 
RAID technology is the greatly improved system reliability 
it provides. Reliability is improved through the use of 
redundancy information in the array which allows the sys- 
tem to continue operating, in a degraded mode, even though 
one of the drives in the array has failed. The failed drive may 
then be replaced, and the lost data regenerated, without 
having to shut down the system. This is a great improvement 
over a single disk system which is rendered inoperable and 
may lose valuable data if the one disk in the system fails. 

RAID technology encompasses a series of techniques for 
managing the operation of multiple disks. These techniques 
are discussed in an article entitled "A Case for Redundant 
Arrays of Inexpensive Disks (RAID)" by Patterson, Gibson, 
and Katz of the University of California (Report No. UCB/ 
CSD 87/391, December 1987) which categorizes the differ- 
ent techniques into five RAID "levels" and is hereby incor- 
porated by reference. Each RAID level represents a different 
approach to storing and retrieving data and the associated 
redundancy information across the array of disk drives. 

For example, FIG. 1 illustrates one embodiment of a 
RAID level 5 data storage system 10. As seen in the figure, 
the system 10 comprises: an array of disk drives 16, iden- 
tified as DRIVE A through DRIVE E; a disk array controller 
12; and a host computer 14. The host computer 14 delivers 
I/O requests to the disk array controller 12 requesting that 
certain read/write operations be performed. The controller 
12 coordinates the transfer of data between the host com- 
puter 14 and the array of disk drives 16 according to RAID 
level 5 techniques in response to those requests. In addition, 
the controller 12 calculates and stores the required redun- 
dancy information, which in a RAID level 5 system com- 
prises parity information. The parity information is simply a 
collection of parity bits which are binary digits used to make 
the sum of all the digits across each redundancy group either 
an even or odd number. 

Blocks 20 through 24 in FIG. 1 illustrate the manner in 
which data and parity information are stored on the five 
array drives in system 10. Data is stored in data blocks 
identified as BLOCK 0 through BLOCK 15. Parity infor- 
mation is stored in parity blocks identified as PARITY 0 
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through PARITY 3. Each parity block is associated with four 
corresponding data blocks, all bcated on a common "stripe" 
across the five array drives, to form a redundancy group. The 
parity information stored in the parity block of any particular 
5 redundancy group is calculated using the data stored in the 
four corresponding data blocks. Consequently, if the data 
stored in one of the data blocks of a redundancy group is 
changed, the corresponding parity information must be 
updated. 

10 Because there is a direct relationship between the data 
stored in a redundancy group and the corresponding parity 
information, if some of the data in the group is lost, such as 
by the failure of one of the disk drives in the array, the parity 
information may be used to reconstruct the lost data. In this 

35 way, the system 10 can continue to perform read and write 
operations even before the failed drive is replaced. It should 
be apparent, however, that in order for the system 10 to 
maintain the increased reliability provided by the above- 
described technique, it is mandatory that the system main- 

20 tain consistency between the data and the parity information 
stored in each of its redundancy groups. 

A problem can arise when the system 10 is performing a 
write operation. The problem stems from the fact that, 
during a write operation, new data and new parity informa- 
tion are normally written to the redimdancy groups at 
different times. Therefore, if a system interruption, such as 
a loss of system power, occurs during the write operation, a 
condition may result where either the new data or the new 
parity information has been written to the redundancy group 
without the other. This creates an inconsistency between 
data and parity within the array 16 which can, in certain 
circumstances, negatively effect the system's ability to oper- 
ate properly. For example, if one of the drives in the array 
fails, it will be impossible to read the data block correspond- 
ing to the failed drive in a redundancy group containing 
inconsistent data/parity information. In addition, a retry of 
the write operation interrupted during the power failure will 
not correct the inconsistency in the redundancy group. 

Therefore, a need exists for a method and apparatus for 
preserving the data/parity consistency in a RAID system. 

SUMMARY OF THE mVE>rnON 

~ The invention relates to a method and apparatus for 
45 maintaining data/parity consistenc^y in a RAID data storage 
^ system. The invention utilizes reserved disk storage space in 
^ the RAID array to log data necessary to restore data/parity 
> consistency shoiiTd an inierruption event, such as a power 
failure,^ corrupt the data stored in a particular redundancy 
50 group. In one embodiment, the invention logs new data and 
new parity information to the reserved disk storage space 
before the new data and new parity information are written 
to the appropriate locations in the associated redundancy 
group. In this way, if an interruption event occurs when 
55 either the new data or the new parity information has already 
been written to the redundancy group but the other has not, 
the corresponding data and parity information stored in the 
reserved disk storage space can be used to restore data/parity 
consistency after the event has ended. The invention is of 
60 particular value in RAID systems which use the read- 
modify- write method to update parity information. 

In one aspect of the present invention, a process is 
provided for use in a method for restoring consistency 
between data and parity information stored in redundancy 
65 groups in a disk array subsystem following an event which 
interrupts the execution of one or more write operations. 
More specifically, the process comprises: (a) reserving log 
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Space on at least one of the disk drives in the array of disk data to calculate new redundancy information for the first 

drives; (b) receiving a write request from a host system redundancy group; (d) storing information in the reserved 

requesting that new data be written to a first data storage area log storage space capable of restoring consistency between 

in a predetermined redundancy group within the array of data and redundancy information stored in the first rcdun- 

disk drives; (c) reading old data from the first data storage 5 dancy group should the first redundancy group become 

area in the predetermined redundancy group and old parity corrupted; (c) storing the new data and the new redundancy 

information from a first parity storage area in the predeter- information in the data portion and the redundancy portion, 

mined redundancy group; (d) using the old data, the old respectively, of the first redundancy group, after the step of 

parity information, and the new data to calculate new parity storing informaUon; and (f) using the information stored in 

information; (e) first writing the new daU and the new parity reserved log storage space to restore consistency 

information to the log space; and (f) second writing, after the between data and redundancy information in the first redun- 

step of first writing, the new data and the new parity dancy group after the occurrence of an interruption event 

information to the first data storage area and the first parity ^tj-^^jj interrupts the operation of the system during the step 

storage area, respectively, in the predetermined redxmdancy Qf second storing. 

S'^^^P' 15 Iq one embodiment, the information stored in the reserved 

In one embodiment of the present mvenUon, the new data ^^^^^^^ ^p^^ includes old data and old redundancy 

and new parity information stored m the reserved log space infonnation from the first redundancy group. In another 

is used to restore consistency in a redundancy group after the embodiment, the information stored in the reserved log 

occurrence of an interruption event. To accomplish this, the ^^^^^g^ ^p^^^ includes the new data and the new redundancy 

new data and new parity information can be simply trans- 20 information 

ferred to proper locations within the redundancy group. The , . * , , 

^ , • 1 J 1 ° A In a last aspect of the present invention, a disk array 

intermplion event may mclude: a low power condition, a . ^ ui f • * • - * u * a / 

device reset, a controller device failure, or other similar subsystem capable of maantammg consistency between data 

event. In another embodiment of the present invention, the and redundancy ^formation stored m redundancy groijps 

disk array subsystem incliides a RAID device operating in ^^'hi" f'^y "^"^^ .s provided. More ^eafically, 

degraded mode la this embodiment, the first data storage 'h^^bsystem oompnses: (a) a loggmg area witbm the array 

^ J J J . . i_ *i_ • J of disk drives; (b) means for coordinating the performance 

area and the second data storage area may both reside on "^^f^ ^ly^ay ^^uJ iii^ou^ . • ^ ,u 

. . . . J ,u u A- of wnte operations to the array of disk drives, the means 

survivor dnves m the array. In yet another embodunent, data / • a . c 

^, J * • J J ^ -1 capable of receivmg new data from a host computer to be 

identifying the predetermined redundancy group is also . , ■ , , a . - a a a • *u 

J - . L At 'ru- 11 ^L^^u^^t^ delivered to a predetermmed redundancy group m the array 

stored in the reserved log space. This allows a controller to 30 r-,,,. /v r ii.- a a 

J , , , A . A •* ' c of disk dnves; (c) means for calculating new redundancy 

determmc where the new data and the new panty intorma- . ^ - r \ ^ . • ^ a a 

tion stored in the reserved log space should be transferred if ^nformation for he predetermmed redundancy group using 

■ , . the new data: (dl means for dehvermg the new data and the 

an mterruption event occurs. j j • t . .l . • • .1. 

f , , ... . r new redundancy information to the logging area in the array 

In another aspect of the present invenUon, a method for ^^^^^ ^^^^ delivering the new data and the 

restormg consistency between data and redundancy mfor- 35 redundancy information to the predetermined redun- 

mation m a RAID data storage system havmg an array of ^ ^^^.^^^^^ 

disk dnves, wherein the array of disk drives includes a ^^^^^^ ^ ^^^^ ^ 

plurahty of redundancy groups each mcluding a data Portion ^j^^ information stored in the logging area to restore 

and a redundancy mforaiation portion, is provided. More ^onsltency between data and redundancy information in the 

specifically, the method comprises: (a) reserving log storage 40 predetermined redundancy group after the occurrence of an 

space in the array of disk dnves; (b) receivmg new data to j;terraption event which interrupts the performance of a 

be stored in the data portion of a fitst redundancy group in ,„„esponding write operation, 
the array of disk dnves in accordance with a predetermmed 

RAID technique, from a host computer; (c) using the new BRIEF DESCRIPTION OF THE DRAWINGS 

data to calculate new redundancy information for the first 45 _ ^ .„ . , „ . , , ^ , 

redundancy group; (d) first storing the new data and the new F^G. 1 illustrates a typical RAID level 5 data storage 

redundancy information in the reserved log storage space; system; 

(e) second storing the new data and the new redundancy FIG. 2 illustrates a data configuration for the array of disk 

information in the data portion and the redundancy portion, drives in one embodiment of the present invention; and 

respectively, of the first redundancy group, after the step of 50 FIG. 3 illustrates a data configuration for the log record of 

first storing; and (e) using the new data and the new FIG. 2 in one embodiment of the present invention, 

redundancy infonnation stored within the reserved log stor- hftatt Pn nv^m IPTTOM 

age space to restore consistency between data and redun- UL.i/ML.nu uiiOL,niriiuix 

dancy information in the first redundancy group after the The invention relates to a method and apparatus for 

occurrence of an interruption event which interrupts the 55 maintaining data/parity consistency in a RAID data storage 

operation of the system during the step of second storing. system. The invention utilizes reserved disk storage space in 

In a third aspect of the present invention, a method is the RAID array to log data necessary to restore data/parity 

provided for restoring consistency between data and redun- consistency should an interruption event, such as a power 

dancy information in a RAID data storage system having an failure, corrupt the data stored in a particular redundancy 

array of disk drives, wherein the anay of disk drives 60 group. In one embodiment, the invention logs new data and 

includes a plurality of redundancy groups each including a new parity information to the reserved disk storage space 

data portion and a redundancy information portion, the before the new data and new parity information are written 

method comprising the steps of: (a) reserving log storage to the appropriate locations in the associated redundancy 

space in the array of disk drives; (b) receiving new data to group. In this way, if an interruption event occurs when 

be stored in the data portion of a first redundancy group in 65 either the new data or the new parity information has already 

the array of disk drives in accordance with a predetermined been written to the redundancy group but the other has not, 

RAID technique, from a host computer; (c) using the new the conesponding data and parity information stored in the 
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reserved disk storage space can be used to restore data/parity 
consistency after the event has ended. The invention is of 
particular value in RAID systems which use the read- 
modify-writc method to update parity information. 
; The invention will be described as implemented in "at 5 
r R^©»^lev^r5 clata storage ^system. ll shbuld^ b apprecialed>- 
f hp.wrcyer^ythat the invention ^may ,bc,4inplemea|o^^ any^. 
RAID system where data> and redun dancy info nnationMisV ~ 
vmttcn^g^ te an^ ^Ld^ 
y^j'steSf^*^^ 

■ the artay lo mtfr iM^^ f^^- 1 

illustrates one method of achieving this data/parity distri- 
bution. In FIG. 1, each parity block stored in the array 
corresponds to 4 data blocks to form a redundancy group, 
i.e., PARITY 0 corresponds to BLOCKs 0, 1, 2, and 3 to 
form a first redundancy group, PARITY 1 corresponds to 
BLOCKS 4, 5, 6, and 7 to form a second redundancy group, 
etc. The parity blocks are "staggered" among the drives so 
that each drive contains some parity information. 

The parity information stored in the parity block of any 
particular redundancy group is calculated using the data 
stored in the 4 corresponding data blocks. The most common 
method of calculating the parity information is by perform- 
ing a bit wise exclusive-OR (XOR) operation on the data 
stored in the blocks. Because the parity information stored 
in the parity block of a redundancy group depends upon the 
data stored in the corresponding data blocks, the parity 
information must be updated if the data stored in any of the 
corresponding data blocks is changed during, for example, a 
write operation. 

In a system which utilizes the bit wise XOR operation to 
calculate the parity information, the relationship between the 
parity and data blocks is as follows: 
PARITY 0-(BLOCK 0) XOR (BLOCK 1) XOR (BLOCK 

2) XOR (BLOCK 3) PARITY IKBLOCK 4) XOR 

(BLOCK 5) XOR (BLOCK 6) XOR (BLOCK 7) PARITY 

2=(BL0CK 8) XOR (BLOCK 9) XOR (BLOCK 10) 

XOR (BLOCK U) 
PARITY 3=(BL0CK 12) XOR (BLOCK 13) XOR (BLOCK 

14) XOR (BLOCK 15) 
Therefore, when the data stored within one of the data 
blocks of a redundancy group is changed, such as during a 
write operation, the new parity information for the redun- 
dancy group can be calculated by reading the contents of all 
of the other data blocks within the group and XORing that 
data with the new data. However, in an altcmative method, 
known as the read-modify -write method, calculation of the 
new parity information can be accomplished in a more 
efiScient fashion by simply reading the old data from the data 
block being changed and the old parity information from the 
corresponding parity block and applying the following equa- 
tion: 

new parity-(old data XOR old parity) XOR new data. 

The read-modify -write method is advantageous in that only 
the data and parity drives which will be updated need to be 
accessed during the write operation, whereas all the drives 
in the array will have to be read or accessed to perform the 
previously described method. 

Drive utilization efficiency may be improved by modify- 
ing the read- modify -write process, separating the execution 
of data read and write operations from the execution of 
parity read, generation and write operations. This modified 
read-modify-write operation identifies the disk drives con- 
taining the data and parity to be updated and places the 
proper read and write requests into the I/O queues for the 
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identified data and parity drives, scheduling some or all 
parity operations (i.e., reading old parity information from 
the parity drive, generating new parity infonnalion, and 
writing the new parity information lo the parity drive) for 
execution when best accommodated in the I/O queue for the 
parity drive, following the read of old data from the data 
'drive. 

When using any of the above-described methods for 
updating the parity information in a redtmdancy group, 
actual write transfers of new data and new parity to the 
associated blocks in the redundancy group need not occur at 
the same time. If either the new data or the new parity is 
written prior to a system interruption or failure, but the other 
is not, the contents of the redundancy group will be 
15 inconsistent, i.e., the parity information will not be in 
agreement with the data stored within the redundancy group. 
A subsequent retry of the write operation interrupted during 
the system failure wiU not correct the inconsistencies in the 
redundancy group. 
20 In accordance with the present invention, storage space is 
reserved somewhere in the array of disk drives 16 for the 
logging of information necessary to restore consistency 
between the data and parity information of redundancy 
groups which were undergoing a write operation during a 
25 system mterruption. FIGS. 2 and 3 illustrate one method of 
storing information in the array of disk drives in accordance 
with the present invention. As seen in FIG. 2, a logging area 
30 is reserved on each disk in the array. The logging area 30 
is comprised of a plurality of log records 32, each record 
30 corresponding \o a particular redundancy group which may 
contain corrupted data. In one approach, the log records 
stored on a particular disk correspond to redundancy groups 
for which the parity block is stored on the same disk. In 
another approach, the log records stored on a particular disk 
35 correspond to redtmdancy groups for which the data block 
which was the subject of the interrupted write operation is 
stored on the same disk. In yet another approach, there is no 
correspondence between the location of a log record and the 
associated redundancy group. 
40 FIG. 3 illustrates the configuration of a log record 32 in 
one embodiment of the present invention. As seen in the 
figure, the log record 32 includes a new data portion 34, a 
new parity portion 36, and a meta-data portion 38. In this 
embodiment of the invention, the disk array controller 12, 
45 after receiving the new data and calculating the new parity 
information corresponding to a particular write request, 
writes the new data, the new parity information, and data 
indicative of the location of the associated data block and 
parity block in the array to the new data portion 34, the new 
50 parity portion 36, and the meta-data portion 38 of the log 
record 32 all in a single I/O operation. 

A description of the operation of the above-described 
embodiment of the present invention will now be made. 
During normal operation, all of the disk drives in the array 
55 16 will be operative. The disk array controller 12 receives a 
write request from the host computer 14 requesting the 
controller 12 to store specific data to a certain location in the 
array 16. The controller 12 determines which data block in 
the array 16 to write the new data to and proceeds to read the 
60 data stored in that data block and the parity information 
stored in the associated parity block. The controller 12 then 
uses this information, along with the new data received from 
the host computer 14, to calculate the new parity informa- 
tion. 

65 After the new parity information has been calculated, the 
controller transfers the new data and the new parity infor- 
mation to the new data section 34 and the new parity section 
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36 of ihe next available log record 32 in the logging area 30 some of Ihe informalion in each redundancy group will be 

of the appropriate disk. The controller also writes the unavailable in degraded mode, making it impossible to 

required identifying data to the meta-data section 38 of the correct data^arity inconsistencies even if they are known to 

log record 32. Only after the transfers have been success- exUt. For example, if drive D in the system 10 of FIG. 1 

fully completed does the controller 12 begin to transfer the 5 faik, then even if it is known that a data/parity inconsistency 

new data and the new parity information to the appropriate wrists in the redundancy group which iiicludes the PARITY 

data block and parity block, respectively, of the correspond- » "ock, it is impossible to restore consistency to the group 

ing redundancy group. After the new data and the new parity because the data which should be stored m BLOCK 2 is 

information have been successfully transferred to the redun- unknown. If the system 10 were not operatmg m degraded 

dancy group, the controUer can erase all of the associated lO " "'""^ only be necessaor to read and bit wise XOR 

information from the log record 32. In this way, the logging 'he data stored id BLOCKs 0, 1. 2, and 3 to calculate the 

area 30 wiU only contain information relating to potentially P^P" P^nty informaUon and hence restore consistency to 

corrupted redundancy groups. the group. . . . . , ... 

At any lime during the operation of the RAID system, an Another reason the mvenUon is of particiilar value dunng 
event may occur which interrupts the operation of the is degraded mode is because parity information is of height- 
system. This interruption event may inchide, for example, a ened importance dunng degraded mode. For example, in a 
power failure, a component failure, or a system reset. If the degraded system, if the host computer 14 requests that data 
event occurs during the performance of a write operation, ^e read from a data block which resides on a failed dnve, the 
non-consistencies may result between data and parity in the requested data can be generated by reading and processing 
airay. If the event occurs before the controUer 12 begins to 20 data from all of the other data blocks in the cotrespond- 
transfer the new dau and the new parity information to the i°e redundancy group and the panty information in that 
redundancy group then no inconsistencies will result group. If the data/panty information in the group is 
because the old data and the old parity information, which inconsistent, the requested data cannot be properly recov- 
are consistent, will still be in place. Similarly, if the event 'r^d. ^ . . , . 
occurs after the transfer of the new data and the new parity 25 Th-^ Pr^°' mvenuon is also of particular value in sys- 
information to the redundancy group is complete, no incon- 'e"^ which utilize redundant controller configurations. A 
sislencies will result because the new data and the new parity redundant controller configuration is simply a amflguration 
information is consistent. However, if the transfers to the which uses two or more controllers so that if one of the 
media are only parUally completed when the interruption controllers fails, the other one can take over without having 
event occurs (i.e., the new daU has been written to the media 30 ^° shut down the system. Because the log data is wntten to 
but the new parity has not, or vice versa), inconsistencies 'he disks m the array, the subordinate controUers have full 
wiU result between the data and parity information stored in access to the data for use in restoring data/parity consistency, 
the subject redundancy group. '° contrast to systems which store log data in a 

After the system 10 recovers from an interruption event, non-volatile memory in the controUer and, therefore, lose 

a re-initializalion procedure will be executed. During this 35 'he log data if the primary controller fails, 

reinitialization procedure, the system 10 will check aU of Although the present ipvenfon has been described m 

the logging areas 30 to determine whether log records 32 conjunction with its preferred embodiment, it is to be 

have been created which were not subsequenUy erased. If understood that modifications and variations may be 

such log records exist, the controller 12 wiU check each resorted to without departmg from the spmt and scope of the 

record to see if it is complete. If a record 32 is not complete, 40 invention as those skiUed in the art readily understand. For 

then the transfer of new data and new parity information to "'""Pl*. °f disk space reserved for loggmg of 

the redundancy group never began and non-consistencies information may reside on a single disk in the array, or any 

were not created. If a record 32 is complete, the controller combination of disks. Also, the mformaUon logged to the 

12 will read the meta-data section 38 of the log record and reserved disk area does not have to include the new data and 

transfer the corresponding new data and new parity infor- 45 'he new parity information. Instead, it may mclude the old 

mation to the appropriate locations in the array, thereby data and the old parity mformation or any other coinbiciaUon 

restoring consistency to the associated redundancy group. adequate to restore consistency to the subject redundancy 

In another embodiment of the present invention, the gro"P- Such modificaUons and vanations are considered to 

information logged to a log record 32 in logging area 30 he within the purview and scope of the invention and the 

does not have to be erased after the new data and the new so appended claims, 

parity information is successfully written to the appropriate ^hat is claimed is: 

redundancy group. Instead, the controller 12 creates another 1- ^ method for restoring consistency between data and 

log which lists the addresses of log records 32 which Parity information stored in a disk array subsystem using a 

correspond to potentially corrupted redundancy groups. Read-Modify-Write operation to said disk array, said 

After the controller 12 detects the initiation of an interrup- 5S "'«'hod compnsmg the steps of: 

Uon event, it examines the current drive activities to identify reserving a prunary log space on one of a plurality of disk 

unfinished write operations. It then deUvers the address of drives in said disk array wherein said primary log space 

the log record corresponding to each of the unfinished write is adapted to store log records comprising data and 

operations to the new log. At re-initialization, the controller parity information adequate to restore consistency of 

12 checks the new log and only performs transfers of new 60 data and parity in said disk array subsystem, wherein 

data and new parity for log records identified therein. said data and parity information is derived from receipt 

As described previously, the present invention has special of a write request; 

application in RAID systems operating in degraded mode. In writing said data and parity information to a log record in 

fact, in one embodiment of the present invention, informa- said primary log space in response to receipt of said 

tion is logged to the array of disk drives 16 only during 65 write request to said disk array; 

degraded mode operation. The reason the invention is of maintaining a secondary log to identify log records in said 

particular value during degraded mode operation is because primary log space; 
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executing said write request, in response to completion of 
said writing step, to store said data and parity infor- 
mation in said disk array subsystem in accordance with 
Read-Modify-Write operation techniques; 

determining that said step of executing said write request 
has not completed wherein the step of deteraaining 
includes the step of searching said secondary log to 
locate log records in said primary log space identifying 
incomplete write requests; and 

writing said data and parity information stored in said log 
record to said disk array in response to the determina- 
tion that execution of said write request has not com- 
pleted. 

2. The method of claim 1 further comprising the step of: 
erasing said log record in response to completing said step 

of executing said write request to said disk array 
subsystem in accordance with Read-Modify-Write 
operation techniques. 

3. The method of claim 1 further comprising the steps of: 
deleting entries in said secondary log in response to 

completing said step of executing said write request to 
said disk array subsystem in accordance with Read- 
Modify-Write operation techniques. 

4. The method of claim 1 wherein said step of writing said 
data and parity information to a log record comprises the 
step of: 

writing new data and new parity information to said log 
record, wherein said new data and new parity informa- 
tion are derived from said write request. 

5. The method of claim 1 wherein said step of writing said 
data and parity information to a log record comprises the 
step of: 

writing old data and old parity information to said log 
record, wherein said old data and old parity information 
are read from said disk array subsystem in response to 
receipt of said write request. 

6. The method of claim 1 wherein said disk array sub- 
system is operating in degraded mode. 

7. The method of claim 1 wherein said step of reserving 
a primary log space on one of a plurality of disk drives 
includes the step of: 

reserving a primary log space on each of said plurality of 
disk drives. 

8. A method for restoring consistency between data and 
parity information stored in a disk array subsystem follow- 
ing the interruption a Read-Modify-Write operation to said 
disk array, said method comprising the steps of: 

reserving a primary log space on one of a plurality of disk 
drives in said disk array wherein said primary log space 
is adapted to store log records comprising data and 
parity information adequate to restore consistency of 
data and parity in said disk array subsystem, wherein 
said data and parity information is derived from receipt 
of a write request; 

writing said data and parity information to a log record in 
said primary log space in response to receipt of said 
write request to said disk array; 

maintaining a secondary log to identify log records in said 
primary log space; 

executing said write request, in response to completion of 
said writing step, to store said data and parity infor- 
mation in said disk array subsystem in accordance with 
Read-Modify-Write operation techniques; 
determining that an interruption occurred during said 
step of executing said write request in accordance 
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with Read-Modify-Write operation techniques 
wherein the step of determining includes the step of 
searching said secondary log to locate log records in 
said primary log space identifying incomplete write 
requests; and 

writing said data and parity information stored in said log 
record to said disk array in response to the determina- 
tion that an interruption occurred during said step of 
executing said write request. 

9. The method of claim 8 further comprising the step of: 
erasing said log record in response to completing said step 

of executing said write request to said disk array 
subsystem in accordance with Read-Modify-Write 
operation techniques. 

10. The method of claim 8 further comprising the steps of: 
deleting entries in said secondary log in response to 

completing said step of executing said write request to 
said disk array subsystem in accordance with Read- 
Modify-Write operation techniques. 
U. The method of claim 8 wherein said step of writing 

said data and parity information to a log record comprises 

the step of: 

writing new data and new parity information to said log 
record, wherein said new data and new parity informa- 
tion are derived from said write request. 

12. The method of claim 8 wherein said step of writing 
said data and parity information to a log record comprises 
the step of: 

writing old data and old parity information to said log 
record, wherein said old data and old parity information 
are read from said disk array subsystem in response to 
receipt of said write request, 

13. The method of claim 8 wherein said disk array 
subsystem is operating in degraded mode. 

14. The method of claim 8 wherein said step of reserving 
a primary log space on one of a plurahty of disk drives 
includes the step of: 

reserving a primary log space on each of said plurality of 
disk drives. 

15. An apparatus for restoring consistency between data 
and parity information stored in a disk array subsystem 
following the interruption a Read-Modify-Write operation to 
said disk array, said apparatus comprising: 

a primary log space on one of a plurality of disk drives in 
said disk array wherein said primary log space is 
adapted to store log records comprising data and parity 
information adequate to restore consistency of data and 
parity in said disk array subsystem, wherein said data 
and parity information is derived from receipt of a write 
request; 

means for v^iting said data and parity information to a log 
record in said primary log space in response to receipt 
of said write request to said disk anay; 

a secondary log to identify log records in said primary log 
space; 

means for executing said write request, in response to 
completion of said writing means, to store said data and 
parity information in said disk array subsystem in 
accordance v^th Read-Modify-Write operation tech- 
niques; 

means for determining that an interruption occurred dur- 
ing said means of executing said write request in 
accordance with Read-Modify-Write operation tech- 
niques wherein said means for determining includes 
means for searching said secondary log to locate log 
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records is said primary log space identifying incom- 
plete write requests; and 
means for writing said data and parity information stored 
in said log record to said disk array in response to the 
determination that an interruption occurred during said 
means of executing said write request. 

16. The apparatus of claim 15 further comprising: 
means for erasing said log record in response to complet- 
ing said execution of said write request to said disk 
array subsystem in accordance with Rcad-Modify- 
Write operation techniques. 

17. The apparatus of claim 15 further comprising: 
means for deleting entries in said secondary log in 

response to completing said means of executing said 
write request to said disk array subsystem in accor- 
dance with Read-Modify- Write operation techniques. 

18. The apparatus of claim 15 wherein said means for 
writing said data and parity information to a log record 
comprises: 
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means for writing new data and new parity information to 
said log record, wherein said new data and new parity 
information are derived from said write request. 

19. The apparatus of claim 15 wherein said means for 
writing said data and parity information to a log record 
comprises: 

means for writing old data and old parity information to 
said log record, wherein said old data and old parity 
information are read from said disk array subsystem in 
response to receipt of said write request. 

20. Tlie apparatus of claim 15 wherein said disk array 
subsystem is operating in degraded mode. 

21. The method of claim 15 wherein said means for 
reserving a primary log space on one of a plurality of disk 
drives includes: 

means for reserving primary log space on each of said 
plurality of disk drives. 
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